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Non-Gaussian velocity distributions 

— The effect on virial mass estimates of galaxy groups 
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ABSTRACT 

We present a study of 9 galaxy groups with evidence for non-Gaussianity in their 
velocity distributions out to 4i?200- This sample is taken from 57 groups selected from 
the 2PIGG catalog of galaxy groups. Statistical analysis indicates that non-Gaussian 
groups have masses significantly higher than Gaussian groups. We also have found that 
all non-Gaussian systems seem to be composed of multiple velocity modes. Besides, 
our results indicate that multimodal groups should be considered as a set of individual 
units with their own properties. In particular, we have found that the mass distribution 
of such units are similar to that of Gaussian groups. Our results reinforce the idea of 
non-Gaussian systems as complex structures in the phase space, likely corresponding 
to secondary infall aggregations at a stage before virialization. The understanding of 
these objects is relevant for cosmological studies using groups and clusters through 
the mass function evolution. 
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1 INTRODUCTION 

Groups of galaxies contain most of galaxies in the Universe 
and are the link between individual galaxies and large-scale 
structures (e.g., Huchra & Geller 1982; Geller & Huchra 
1983; Nolthenius & White 1987; Ramella et al. 1989). The 
dissipationless evolution of these systems is dominated by 
gravity. Interactions over a relaxation time tend to distribute 
the velocities of the galaxy members in a Gaussian distri- 
bution (e.g. Bird & Beers 1993). Thus, a way to access the 
dynamical stage of galaxy groups is to study their velocity 
distributions. Evolved systems are supposed to have Gaus- 
sian velocity distributions, while those with deviations from 
normality are understood as less evolved systems. Hou et al. 
(2009) have examined three goodness-of-fit tests (Anderson- 
Darling, Kolmogorov and \ 2 tests) to find which statistical 
tool is best able to distinguish between relaxed and non- 
relaxed galaxy groups. Using Monte Carlo simulations and 
a sample of groups selected from the CNOC2, they found 
that the Anderson-Darling (AD) test is far more reliable at 
detecting real departures from normality in small samples. 
Their results show that Gaussian and non-Gaussian groups 
present distinct velocity dispersion profiles, suggesting that 
discrimination of groups according to their velocity distri- 
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butions may be a promising way to access the dynamics of 
galaxy systems. 

Recently, Ribeiro, Lopes & Trevisan (2010) extended up 
this kind of analysis to the outermost edge of groups to probe 
regions where galaxy systems might not be in dynamical 
equilibrium. They found significant segregation effects after 
splitting up the sample in Gaussian and non-Gaussian sys- 
tems. In the present work, we try to further understand the 
nature of non-Gaussian groups. In particular, we investigate 
the problem of mass estimation for this class of objects, and 
their behaviour in the phase space. The paper is organized as 
follows: in Section 2 we present data and methodology; Sec- 
tion 3 contains a statistical analysis with sample tests and 
multimodality diagnostics for non-Gaussian groups; finally, 
in Section 4 we summarize and discuss our findings. 



2 DATA AND METHODOLOGY 
2.1 2PIGG sample 

We use a subset of the 2PIGG catalog, corresponding to 
groups located in areas of at least 80% redshift coverage 
in 2dF data out to 10 times the radius of the systems, 
roughly estimated from the projected harmonic mean (Eke 
et al. 2004). The idea of working with such large areas is 
to probe the effect of secondary infall onto groups. Mem- 
bers and interlopers were redefined after the identification of 
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gaps in the redshift distribution according to the technique 
described by Lopes et al. (2009). Finally, a virial analysis 
is perfomed to estimate the groups' properties. See details 
in Lopes et al. (2009a); Ribeiro et al. (2009); and Ribeiro, 
Lopes & Trevisan (2010). We have classified the groups after 
applying the AD test to their galaxy velocity distributions 
(see Hou et al. 2009 for a good description of the test). This 
is done for different distances, producing the following ratios 
of non-Gaussian groups: 6% (R < I-R200), 9% (R < 2i?2oo), 
and 16% (R ^ 3i?200 and R ^ 4i?20o)- We assume this 
latter ratio (equivalent to 9 systems) as the correct if one 
desires to extending up the analysis to the regions where 
galaxy groups might not be in dynamical equilibrium. Ap- 
proximately 90% of all galaxies in the sample have distances 
^ 4i?2oo • This is the natural cutoff in space we have made in 
this work. Some properties of galaxy groups are presented in 
Table 1, where non-Gaussian groups are identified with an 
asterisk. Cosmology is defined by £l m = 0.3, £l\ = 0.7, and 
Ho = 100 h km s _1 Mpc _1 Distance-dependent quantities 
are calculated using h — 0.7. 



3 NON-GAUSSIANITY AND MASS 
3.1 The mass bias 

We consider the two sample statistical problem for the Gaus- 
sian (G) and NG (non-Gaussian) subsamples. We choose 
the mass resulting from the virial analysis as the property 
to illustrate the comparison between the subsamples. Both 
Kolmogorov-Smirnov (KS) and Cramer-von Mises (CvM) 
tests reject the hypothesis that the NG subsample is dis- 
tributed as the G subsample, with p-values 0.00001 and 
0.00029, respectively. For these tests, we have used 1000 
bootstrap replicas of each subsample to alleviate the small 
sample effect. The result indicates an inconsistency between 
the mass distributions of G and NG groups. This could rep- 
resent a real physical difference or, more probably, an indica- 
tion of a significant bias to higher masses in NG groups. The 
median mass for this subsample is (M^q ) = 2.57x 10 14 Mq, 
while it is (M200) = 8.85 x 10 13 Mq for the G subsample, 
thus (M200) is larger by a factor of ~ 2.9. In the follow- 
ing, we investigate this mass bias looking for features in the 
velocity distributions of galaxy groups. 



3.2 Exploring non-Gaussianity 

The shape of velocity distributions may reveal a signature 
of the dynamical stage of galaxy groups. For instance, sys- 
tems with heavier tails than predicted by a normal parent 
distribution may be contamined by interlopers. Otherwise, 
systems with lighter tails than a normal may be multimodal, 
consisting of overlapping distinct populations (e.g. Bird & 
Beers 1993). We now try to understand the mass bias in the 
NG subsample by studying non-Gaussianity in the velocity 
distributions. Since we have carefully removed interlopers 
from each field (see Section 2.1; and Lopes et al. 2009a), 
we consider here that the most probable cause of normal- 
ity deviations in our sample is due to a superposition of 
modes in the phase space (e.g. Diemand & Kuhlen 2008). 
Visual inspection of radial velocity histograms of NG sys- 



Table 1. Main properties of groups 



Group 


#200 (Mpc) 


M200 (10 14 M ) 


a (km/s) 


- 1 * memo 


N200 


55 


0.689 


0.400 


1 73 903 


16 


9 


60 


0.664 


0.359 


1 90 3Q^ 


41 


12 


84 


1.025 


1.326 


224.715 


54 


10 


91 


0.848 


0.754 


164.293 


34 


4 


102* 


1.550 


4.602 


433 91 8 
too. z _i_o 


32 


g 


130 


0.999 


1.245 


9A0 Q08 


43 


11 


138* 


1.271 


2.578 


348 88^ 

01:0.000 


74 


14 


139 


0.997 


1.250 


93^ 740 


39 


12 


169 


0.765 


0.573 


99^ 91 9 


9 


4 


177* 


1.178 


2.100 


33fi 30Q 
oou. ouy 


45 


14 


179* 


1.663 


5.897 


471.352 


23 


10 


1 &i 
101 


1 247 


9 A88 


QQQ Q7Q 

oyo.o < y 


9Q 
zy 


12 


188 


0.514 


0.174 


153.714 


g 


2 


191 


0.707 


0.456 


9fi8 ^1 7 


14 


10 


197 


0.897 


0.930 


9Q7 Q03 
Ziij 1 . yuo 


13 


7 


204 


1 AQ1 


A 333 


/IVQ CI Q 
4: I O.OLO 


33 
00 


12 


209 


0.680 


0.407 


1 39 flQ^ 

_i_ozr. uyu 


15 


4 


222 


1.243 


2.494 


31 ^ 3^ 
J. 0. 000 


22 


g 


236 


1.258 


2.575 


331.308 


19 


7 


971 


u.yuo 


1 0Q3 


940 ficn 


41 


1 ^ 

TO 


326 


0.648 


0.333 


167.764 


21 


7 


oco* 


1 fi^fi 

i.UOU 


^ fi1 8 




31 
1 


1 3 
TO 


353 


0.873 


0.826 


990 09^ 


34 


12 


374 


0.902 


0.919 


91 7 038 

Zr J_ f . UOO 


21 


10 


377 


0.480 


0.138 


1 39 Afifi 


13 


5 


387 


0.649 


0.345 


1 9^ 308 


32 


5 


398 


0.781 


0.601 


1 89 800 


29 


12 


399 


0.917 


0.973 


1 83 3Q7 
loo.oy 1 


34 


7 


409 


1.078 


1.583 


251.071 


64 


11 


410 


0.847 


0.769 


1 QQ A(\?\ 


42 


g 


428 


1.201 


2.191 


349 440 

Ot^ ■ 1 -LW 


26 


14 


435 


0.606 


0.283 


123.549 


23 


4 


444 


0.820 


0.697 


944 4Q1 


13 


7 


447* 


1.169 


2.032 


9Qfi Q1 Q 


37 


6 


453 


0.827 


0.720 


271.840 


17 


10 


455 


1.588 


5.088 


454.227 


65 


15 


456 


0.319 


0.041 


58.138 


16 


3 


458 


0.642 


0.336 


115.674 


18 


4 


466 


1.507 


4.381 


554.511 


23 


13 


471 


1.128 


1.832 


3QQ 078 


22 


14 


475* 


1.152 


1.952 


294.413 


28 


6 


479 


0.806 


0.671 


1 fiO 887 

1UU.OO 1 


17 


6 


480 


0.923 


1.006 


979 O'SS 


23 


12 


482 


0.663 


0.373 


909 Q03 
ZrUZr. yuo 


17 


9 


484 


1.099 


1.702 


268.116 


32 


7 


485 


1.679 


6.076 


559.140 


29 


18 


488 


1.220 


2.334 


326.566 


42 


12 


489 


0.872 


0.852 


188.283 


30 


6 


493 


1.170 


2.061 


323.268 


30 


10 


504 


1.319 


2.985 


429.207 


30 


13 


505* 


1.176 


2.111 


336.804 


20 


7 


507* 


1.459 


4.036 


436.498 


27 


11 


513 


0.724 


0.495 


178.637 


19 


6 


515 


1.117 


1.817 


298.220 


30 


13 


519 


0.808 


0.689 


207.011 


17 


4 


525 


0.495 


0.158 


108.230 


9 


3 


536 


2.003 


10.576 


676.902 


45 


24 



3 



tems suggests that multipeaks really happen in most cases 
(see Figure 1). 

We statistically check multimodality by assuming the 
velocity distributions as Gaussian mixtures with unknown 
number of components. We use the Dirichlet Process mix- 
ture (DPM) model to study the velocity distributions. The 
DPM model is a Bayesian nonparametric methodology that 
relies on Markov Chain Monte Carlo (MCMC) simulations 
for exploring mixture models with an unknown number of 
components (Diebolt & Robert 1994). It was first formalised 
in Ferguson (1973) for general Bayesian statistical modeling. 
The DPM is a distribution over fc-dimensional discrete dis- 
tributions, so each draw from a Dirichlet process is itself 
a distribution. Here, we assume that a galaxy group is a 
set of k components, ^2 =1 ^if(y\0i), with galaxy velocities 
distributed according to Gaussian distributions with mean 
and variance unknown. In this framework, the numbers m 
are the mixture coefficients that are drawn from a Dirich- 
let distribution. In the DPM model, the actual number of 
components k used to model data is not fixed, and can be 
automatically inferred from data using the usual Bayesian 
posterior inference framework. See Neal (2000) for a survey 
of MCMC inference procedures for DPM models. 

In this work, wc find k using the R language and en- 
vironment (R Develoment Core Team) under the dpmixsim 
library (da Silva 2009). The code implements mixture mod- 
els with normal structure (conjugate normal-normal DPM 
model). First, it finds the coefficients m, and then sepa- 
rates the components of the mixture, according to the most 
probable values of 7Tj, in the distributional space, leading to 
a partition of this space into regions (da Silva 2009). The 
results can be visually analysed by plotting the estimated 
kernel densities for the MCMC chains. In Figure 2, we show 
the DPM diagnostics for each group, that is, the deblended 
modes in the velocity distributions. We have found the fol- 
lowing number of modes per group: 4/102, 3/138, 3/177, 
2/179, 5/352, 3/447, 2/475, 3/505, 3/507. Therefore, all non- 
Gaussian groups in our sample are multimodal (reaching a 
total of 28 modes) according to the DPM analysis. Unfor- 
tunately, we cannot compute the physical properties of 13 
modes, due to intrinsic scattering in velocity data (and/or 
to the smallness of the modes - those with less than 4 mem- 
bers). The properties of the other 15 modes are presented 
in Table 2. 

3.3 The mass bias revisited 

Now, we perform again the statistical tests, comparing the 
distribution of mass in NG and G with the new sample 
(see Figure 3). First, we compare the NG subsample and 
the sample of modes (M). Both KS and CvM tests reject 
the hypothesis that M is distributed as NG, with p=0.0211 
and p=0.0189, respectively. Then, we compare G and M. 
Now, the tests accept the hypothesis that M is distributed 
as G, with p=0.4875 and p=0.4695. Hence, the distribu- 
tion of the modes deblended from non-Gaussian groups are 
themselves mass distributed as Gaussian groups. Also, the 
median mass of the M sample is (M 2 ^ ) = 1.33 x 10 14 Mq, 
a value larger than (M§) ) only by a factor ~ 1.5 (before 
deblending groups the factor was ~ 2.9). This consistency 
between G and M objects indicate that non-Gaussian groups 
are a set of smaller systems, probably forming an aggre- 



102 138 177 




24000 26000 28000 28000 31000 34000 34000 36000 38000 40000 

V (km/s) V (km/s) V (km/s) 



179 352 447 




35000 37000 22000 26000 30000 32000 34000 36000 

V (km/s) V (km/s) V (km/s) 



475 505 507 




32000 34000 36000 39000 41000 43000 41000 43000 

V (km/s) V (km/s) V (km/s) 



Figure 1. Histograms of radial velocities for all non-Gaussian 
groups. 
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Figure 2. DPM density probability decomposition. Velocities are 
rescaled to the [0,1] interval. 

gate out of equilibrium. Indeed, groups are the link between 
galaxies and larger structures. Thus, our results suggest we 
are witnessing secondary infall (the secondary modes) onto 
a previously formed (or still forming) galaxy system (the 
principal mode). Naturally, we cannot discard the possibil- 
ity of NG groups being unbound systems of smaller groups 
seen in projection, although the properties of the indepen- 
dent modes are quite similar to those found in physically 
bound groups. 
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Table 2. Main properties of individual modes 
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Figure 3. Histograms for Gaussian (dark gray) and non-Gaussian 
(white) groups. Histogram of modes is plotted in light gray. Den- 
sity probability comparison among systems. Intersections areas 
are in white. 

4 DISCUSSION 

We have classified galaxy systems after applying the AD 
normality test to their velocity distributions up to the out- 
ermost edge of the groups. The purpose was to to investi- 
gate regions where galaxy systems might not be in dynami- 
cal equilibrium. We have studied 57 galaxy groups selected 
from the 2PIGG catalog (Eke et al. 2004) using 2dF data out 
to 4i?200- This means we probe galaxy distributions near to 
the turnaround radius, thus probably taking into account all 
members in the infall pattern around the groups (e.g. Rines 
& Diaferio 2006; Cupani, Mezzetti & Mardirossian 2008). 
The corresponding velocity fields depend on the local den- 
sity of matter. High density regions should drive the forma- 
tion of virialized objects, whereas low density environments 
are more likely to present streaming motions, i.e., galaxies 
falling toward larger potential wells constantly increasing 



the amplitude of their clustering strength (e.g. Diaferio & 
Geller 1997). 

We have found that 84% of the sample is composed of 
systems with Gaussian velocity distributions. These systems 
could result from the collapse and virialization of high den- 
sity regions with not significant secondary infall. They could 
be groups surrounded by well organized infalling motions, 
possibly reaching virialization at larger radii. Theoretically, 
in regions outside ifeoo , we should apply the non-stationary 
Jeans formalism, leading us to the virial theorem with some 
correction terms. These are due to the infall velocity gradi- 
ent along the radial coordinate and to the acceleration of the 
mass accretion process (see Cupani 2008). Their contribu- 
tion is likely to be generally negligible in the halo core where 
the matter is set to virial equilibrium and to become signifi- 
cant in the halo outskirts where the matter is still accreting 
(see e.g. Cupani, Mezzetti & Mardirossian 2008). However, 
a slow and well organized infalling motion to the center of 
the groups could diminish the importance of the correction 
terms, i.e., the systems could be in a quasi-stationary state 
outside i?200 • The high fraction of groups immersed in such 
surroundings suggests that ordered infalling motion around 
galaxy groups might happen quite frequently in the Uni- 
verse. 

In any case, Gaussian groups can be considered as dy- 
namically more evolved systems (see Ribeiro, Lopes & Tre- 
visan 2010). The remaining 16% of the sample is composed 
of non-Gaussian groups. We found that these sytems have 
masses significantly larger than Gaussian groups. This bias- 
ing effect in virial masses is basically due to the higher ve- 
locity dispersions in NG groups. Ribeiro, Lopes & Trevisan 
(2010) found that NG groups have rising velocity dispersion 
profiles. A similar result was found by Hou et al. (2009). Ris- 
ing velocity dispersion profiles could be related to the higher 
fraction of blue galaxies in the outskirts of some galaxy sys- 
tems (see Ribeiro, Lopes & Trevisan 2010; and Popesso et 
al. 2007). 

At the same time, the NG subsample is composed of 
multipeak objects, identified by the DPM model analysis 
applied to the velocity space. These results indicate that sec- 
ondary infall might be biasing the mass estimates of these 
groups. Thus, the NG systems could result from the collapse 
of less dense regions with significant secondary infall input. 
Contrary to Gaussian groups, the surroundings of NG sys- 
tems do not seem to be in a quasi-stationary state. They are 
dynamically complex. Actually, we have found these groups 
can be modelled as assemblies of smaller units. After de- 
blending groups into a number of individual modes, we have 
verified that the mass distribution of these objects is con- 
sistent with that of Gaussian groups, suggesting that each 
unit is probably a galaxy group itself, a system formed dur- 
ing the streaming motion toward the potential wells of the 
field. 

Our results reinforce the idea of NG systems as complex 
structures in the phase space out to 4i?200- This scenario is 
illustrated in Figure 4, where we show the stacked G and NG 
groups, and the stacked modes. Galaxies in these compos- 
ite groups have distances and line-of-sight velocities with 
respect to the centers normalized by R200 and a, respec- 
tively. In the upper box, we present the G stacked group. 
Note that galaxies are extensively concentrated in the phase 
space diagram, with a single density peak near to 0.5i? 2 oo, 
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Galaxies from G groups 




Figure 4. Phase space diagrams for a typical Gaussian (upper 
box), non-Gaussian (middle box) and modes (lower box) systems. 
Density contours indicate galaxy concentration across the dia- 
grams. Distances to the center of the systems are normalized by 
R200- Radial velocities are subtracted from the median velocity 
and divided by the velocity dispersion of the groups. 



revealing a well organized system around this point. Also, 
note that the density peak has Vi os /a w 0. Contour density 
lines suggest that ordered shells of matter are moving toward 
the center. A different result is found for the NG stacked 
group. In the middle box of Figure 4, we see a less concen- 
trated galaxy distribution, with less tight density contour 
levels, presenting a density peak slightly larger than -R200, 
and two additional peaks, around 2i?200, possibly interact- 
ing with the central mode. The additional peaks are not 
aligned at Vi os /a « 0, suggesting a less symmetrical galaxy 
distribution in the phase space. These features suggest that 
non-Gaussian systems are distinct, and dynamically younger 
than Gaussian groups, which agrees well with the results 
of Ribeiro, Lopes & Trevisan (2010). Also in Figure 4, we 
present the modes stacked system in the lower box. Similar 
to the Gaussian case, we have a single peak in the phase 
space, near to 1.5i?200- Galaxy distribution however is still 
less symmetrical than in Gaussian groups, with the peak not 
aligned at Vi os /a « 0. This suggests that, although more 
organized than NG systems, modes are probabably dynam- 
ically distinct and younger than Gaussian groups as well. 

Our work points out the importance of studying NG 
systems both to possibly correct their mass estimates and 
multiplicity functions, as well as to better understand galaxy 
clustering at group scale. Understanding of these objects is 
also relevant for cosmological studies using groups and clus- 
ters through the evolution of the mass function (Voit 2005). 
Using systems with overestimated properties may lead to a 
larger scatter in the mass calibration (Lopes ct al. 2009b) 
and could also affect the mass function estimate (Voit 2005). 
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